Discovering Implicit Entity Relation with the Gene-Citation-Gene Network

نویسندگان

  • Min Song
  • Nam-Gi Han
  • Yong-Hwan Kim
  • Ying Ding
  • Tamy Chambers
چکیده

In this paper, we apply the entitymetrics model to our constructed Gene-Citation-Gene (GCG) network. Based on the premise there is a hidden, but plausible, relationship between an entity in one article and an entity in its citing article, we constructed a GCG network of gene pairs implicitly connected through citation. We compare the performance of this GCG network to a gene-gene (GG) network constructed over the same corpus but which uses gene pairs explicitly connected through traditional co-occurrence. Using 331,411 MEDLINE abstracts collected from 18,323 seed articles and their references, we identify 25 gene pairs. A comparison of these pairs with interactions found in BioGRID reveal that 96% of the gene pairs in the GCG network have known interactions. We measure network performance using degree, weighted degree, closeness, betweenness centrality and PageRank. Combining all measures, we find the GCG network has more gene pairs, but a lower matching rate than the GG network. However, combining top ranked genes in both networks produces a matching rate of 35.53%. By visualizing both the GG and GCG networks, we find that cancer is the most dominant disease associated with the genes in both networks. Overall, the study indicates that the GCG network can be useful for detecting gene interaction in an implicit manner.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discovering implicit associations among critical biological entities

We propose an approach to predicting implicit gene-disease associations based on the inference network, whereby genes and diseases are represented as nodes and are connected via two types of intermediate nodes: gene functions and phenotypes. To estimate the probabilities involved in the model, two learning schemes are compared; one baseline using co-annotations of keywords and the other taking ...

متن کامل

Gene Regulation Network Based Analysis Associated with TGF-beta Stimulation in Lung Adenocarcinoma Cells

Background: Transforming growth factor (TGF)-β is over-expressed in a wide variety of cancers such as lung adenocarcinoma. TGF-β plays a major role in cancer progression through regulating cancer cell proliferation and remodeling of the tumor micro-environment. However, it is still a great challenge to explain the phenotypic effects caused by TGF-β stimulation and the effect of TGF-β stimulatio...

متن کامل

Investigating the Relation between LCK Gene Expression with Type 2 Diabetes Patients in Yazd Diabetes Research Center

Type 2 diabetes mellitus (T2DM) is characterized by insulin resistance and insulin secretory defect. Deficiency of cellular immunity is known as one of the factors involved in the pathogenesis of T2DM. lymphocyte-specific protein tyrosine kinase( LCK) is an important gene involved in the intracellular signaling pathways of lymphocytes. This study aimed at determining and comparing LCK gene expr...

متن کامل

Study and detection of ERG11 gene mutation in resistant - drug Candida albicans and relation with Iranian women infertility

Aim and Background: Candida  Albicans is the most important cause of  vulvovaginal candidiasis and resistance to azoles can occur via various mechanisms including, change in the ERG11 gene. This study aim was to identification  of  ERG11 gene mutations in drug  resistant Candida albicans isolated from patients with Candida vaginitis and its association with infertility in Iranian women.   Mat...

متن کامل

Exploring Gene Signatures in Different Molecular Subtypes of Gastric Cancer (MSS/ TP53+, MSS/TP53-): A Network-based and Machine Learning Approach

Gastric cancer (GC) is one of the leading causes of cancer mortality, worldwide. Molecular understanding of GC’s different subtypes is still dismal and it is necessary to develop new subtype-specific diagnostic and therapeutic approaches. Therefore developing comprehensive research in this area is demanding to have a deeper insight into molecular processes, underlying these subtypes. In this st...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2013